Slavic Corpus and Computational Linguistics
نویسندگان
چکیده
منابع مشابه
Building Bridges: Slavic Linguistics Going Cognitive
Steven Franks (1996) in his Reflections piece from the Journal of Slavic Linguistics made a comment that “...the walls that divide us [linguists] are coming down all over. What’s next?”, he asked. “Cognitive Science?” I think the answer to this question is a definite yes. The ability to produce and comprehend language is crucial for functioning in our society, and for the past two decades, ling...
متن کاملProducing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations
The main task of the tokenization is to divide the sentences of the text into its constituent units and remove punctuation marks (dots, commas, etc.). Each unit is a continuous lexical or grammatical writing chain that is an independent semantic unit. Tokenization occurs at the word level and the extracted units can be used as input to other components such as stemmer. The requirement to create...
متن کاملPreliminary Analysis of a Slavic Parallel Corpus
The focus of this paper is on a detailed description of a newlydeveloped parallel corpus of Slavic languages. It consists of 11 Slavic translations of the well-known Russian socialist realist novel “Kak zakaljalas’ stal’/How the steel was tempered” (KZS), written by N.A. Ostrovskij in the years 1932-34. The KZS contains the Slovene, Croatian, Serbian (ekavian), Macedonian, Bulgarian, Ukrainian,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Slavic Linguistics
سال: 2017
ISSN: 1543-0391
DOI: 10.1353/jsl.2017.0008